Automatic 3D face synthesis using single 2D video frame - Electronics Letters
نویسنده
چکیده
Introduction: Automatic 3D face synthesis plays a crucial role in many applications. One example is real-time 3D model-based video coding. Here, a generic 3D face model should ideally be adapted automatically to a human face in the first video frame. Additionally, facial texture should be acquired without supervision. However, the time-consuming 3D scanner [1] is not accepted for such an automatic system. The method of using two orthogonal photos [2] cannot provide real-time processing either. Feng [3] synthesised the face only from a single image, but this method needs to estimate head rotation parameters by using another reference image. Moreover, it hardly brings accurate results in 3D face model adaptation by only using three facial features. Strictly speaking, there is no fully automatic 3D face synthesis from a single image proposed to make such a real-time video application a reality. This Letter presents an automatic synthesis system, enabling 3D face modelling from an arbitrary 2D header-and-shoulder video frame without partial occlusion.
منابع مشابه
Multimodal Translation System Using Texture-Mapped Lip-Sync Images for Video Mail and Automatic Dubbing Applications
We introduce a multimodal English-to-Japanese and Japanese-to-English translation system that also translates the speaker’s speech motion by synchronizing it to the translated speech. This system also introduces both a face synthesis technique that can generate any viseme lip shape and a face tracking technique that can estimate the original position and rotation of a speaker’s face in an image...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کاملDense 3D face alignment from 2D video for real-time use
To enable real-time, person-independent 3D registration from 2D video, we developed a 3D cascade regression approach in which facial landmarks remain invariant across pose over a range of approximately 60 degrees. From a single 2D image of a person’s face, a dense 3D shape is registered in real time for each frame. The algorithm utilizes a fast cascade regression framework trained on high-resol...
متن کامل3D Generic Elastic Models for 2D Pose Synthesis and Face Recognition
Pose, illumination, expression and the generalization of such effects to unseen face data samples are the fundamental problems faced in face recognition. The significant contribution of this thesis is the ability to match any two face images with a large pose angle variation. This approach utilizes a proposed 3D prior face model in order to cover a wide range of poses. To achieve this, a rapid ...
متن کاملVisual speech synthesis from 3D video
Data-driven approaches to 2D facial animation from video have achieved highly realistic results. In this paper we introduce a process for visual speech synthesis from 3D video capture to reproduce the dynamics of 3D face shape and appearance. Animation from real speech is performed by path optimisation over a graph representation of phonetically segmented captured 3D video. A novel similarity m...
متن کامل